Making Full Use of Chinese Speech Corpora

نویسنده

  • Thomas Fang Zheng
چکیده

It is well understood that the speech databases play a very important role for speech recognition. It is a dream for speech recognition researchers to create more useful databases with smaller efforts. To achieve this goal, the database should be well designed at first, and tools and more information should be provided so that the databases can be made full use of. This paper will illustrate the criteria according to which the Chinese speech databases will be created for different purposes. The way of transcription will also be discussed, which is the first thing to do after the data creation. Then examples on how to learn knowledge from the created database for other research purpose will be given. 1 Purpose of Speech Corpora The speech corpora play a very important role in speech and language processing, and this has been aware of by the speech community for years. To make full of speech corpora efficiently, people in the speech community worldwide have established consortiums, such as LDC, ELRA, and so on. The purpose of speech corpora can be illustrated in Table 1 (Kuwabara 2002). Table 1. Purpose of Speech Corpora Item Description Percentage 1. Speech/speaker recognition system development, evaluation, sentence comprehension and summarization, speech recognition, speaker recognition 73% 2. Speech synthesis system development, prosodic analysis 11% 3. Acoustic analysis acoustic analysis, speech coding 9% 4. Sentence analysis syntactic and semantic analysis 5% 5. Speech/language education speech and language education 2%

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Highly Usable and Robust Spoken Language Technologies for Chinese

This paper gives an overview of our research on Chinese spoken language technologies during the past ten years. It covers fundamental acoustic-phonetic studies of spoken Cantonese, speech corpora development, automatic speech recognition and text-to-speech. Currently our focus is on making these technologies more usable for general users who are not speech experts, and more robust for real-worl...

متن کامل

Construction of Chinese Segmented and POS-tagged Conversational Corpora and Their Evaluations on Spontaneous Speech Recognitions

The performance of a corpus-based language and speech processing system depends heavily on the quantity and quality of the training corpora. Although several famous Chinese corpora have been developed, most of them are mainly written text. Even for some existing corpora that contain spoken data, the quantity is insufficient and the domain is limited. In this paper, we describe the development o...

متن کامل

Chinese Part-of-Speech Tagging: One-at-a-Time or All-at-Once? Word-Based or Character-Based?

Chinese part-of-speech (POS) tagging assigns one POS tag to each word in a Chinese sentence. However, since words are not demarcated in a Chinese sentence, Chinese POS tagging requires word segmentation as a prerequisite. We could perform Chinese POS tagging strictly after word segmentation (one-at-a-time approach), or perform both word segmentation and POS tagging in a combined, single step si...

متن کامل

Speech Recognition and Information Retrieval: Experiments in Retrieving Spoken Documents

The Informedia Digital Video Library Project at Carnegie Mellon University is making large corpora of video and audio data available for full content retrieval by integrating natural language understanding, image processing, speech recognition and information retrieval. Information retrieval of from corpora of speech recognition output is critical to the project’s success. In this paper, we out...

متن کامل

Spoken language resources for Cantonese speech processing

This paper describes the development of CU Corpora, a series of large-scale speech corpora for Cantonese. Cantonese is the most commonly spoken Chinese dialect in Southern China and Hong Kong. CU Corpora are the first of their kind and intended to serve as an important infrastructure for the advancement of speech recognition and synthesis technologies for this widely used Chinese dialect. They ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of Chinese Language and Computing

دوره 14  شماره 

صفحات  -

تاریخ انتشار 2004